A Statistical Language Modeling Approach to Lattice-Based Spoken Document Retrieval

نویسندگان

Tee Kiah Chia

Haizhou Li

Hwee Tou Ng

چکیده

Speech recognition transcripts are far from perfect; they are not of sufficient quality to be useful on their own for spoken document retrieval. This is especially the case for conversational speech. Recent efforts have tried to overcome this issue by using statistics from speech lattices instead of only the 1best transcripts; however, these efforts have invariably used the classical vector space retrieval model. This paper presents a novel approach to lattice-based spoken document retrieval using statistical language models: a statistical model is estimated for each document, and probabilities derived from the document models are directly used to measure relevance. Experimental results show that the lattice-based language modeling method outperforms both the language modeling retrieval method using only the 1-best transcripts, as well as a recently proposed lattice-based vector space retrieval method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Language Modeling Approach for Retrieving Passages in Lecture Audio Data

Spoken Document Retrieval (SDR) is a promising technology for enhancing the utility of spoken materials. After the spoken documents have been transcribed by using a Large Vocabulary Continuous Speech Recognition (LVCSR) decoder, a text-based ad hoc retrieval method can be applied directly to the transcribed documents. However, recognition errors will significantly degrade the retrieval performa...

متن کامل

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

Reducing the effect of OOV query words by using morph-based spoken document retrieval

Morph-based spoken document retrieval uses morpheme-like subword units for both language modeling and as

متن کامل

Syllable-Based Chinese Text/Spoken Document Retrieval Using Text/Speech Queries

In order to solve the problem with the fast growth of Chinese information resources on the Internet, this paper deals with the problem of Chinese text and spoken document retrieval using both text and speech queries. By properly utilizing the monosyllabic structure of Chinese language, the proposed approach performs the statistical similarity estimation between the text/speech queries and the t...

متن کامل

A study of term weighting in phonotactic approach to spoken language recognition

In the spoken language recognition approach of modeling phonetic lattice with the Support Vector Machine (SVM), term weighting on the supervector of N-gram probabilities is critical to the recognition performance because the weighting prevents the SVM kernel from being dominated by a few large probabilities. We investigate several term weighting functions that are used in text retrieval, which ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2007

A Statistical Language Modeling Approach to Lattice-Based Spoken Document Retrieval

نویسندگان

چکیده

منابع مشابه

Language Modeling Approach for Retrieving Passages in Lecture Audio Data

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Reducing the effect of OOV query words by using morph-based spoken document retrieval

Syllable-Based Chinese Text/Spoken Document Retrieval Using Text/Speech Queries

A study of term weighting in phonotactic approach to spoken language recognition

عنوان ژورنال:

اشتراک گذاری